# Multimodal OCR
Qwen2 VL OCR 2B Instruct GGUF
Apache-2.0
A multimodal model fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, optimized for OCR, image-to-text conversion, LaTeX math solving, and handwriting recognition
Image-to-Text Supports Multiple Languages
Q
prithivMLmods
142
1
Olmocr 7B 0225 Preview
Apache-2.0
A document OCR model fine-tuned based on Qwen2-VL-7B-Instruct, supporting multilingual document recognition and metadata extraction
Text Recognition
Transformers English

O
FriendliAI
322
1
Erax VL 7B V1.5 GGUF
Apache-2.0
Quantized version of EraX-VL-7B-V1.5, supporting Vietnamese, English, and Chinese, suitable for tasks like insurance and OCR.
Image-to-Text Supports Multiple Languages
E
mradermacher
190
1
Featured Recommended AI Models